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1 


GCGGCCGCGT 


CGACGTCTTT 


GCTGCCGCAC 


AGGGAGCAGC 


AGCAGCCGCC 


GACCCGATCC 


61 


CTTGGGAGCC 


CACCAAGTGC 


TGCGCTTGCT 


TAGCAGCTAC 


AGGAGCTGCC 


GCGGGGTTGC 


121 


TCCCTGAGGC 


AGCGTGCATG 


TATGGTCCGG 


CAGCCAGCTT 


GGTGTCGCAG 


CCGTACTTCT 


181 


TGGAAGCGAG 


AGAGACTGTG 


GGAGAGCGCA 


AATCACTCCA 


GCCGCTTCCA 


GGGGAGTCTG 


241 


GGGACCGCAG 


GAGCGTTGGA 


GGCTGCCTGC 


CGGCATAAAC 


AGGAACAAGC 


GCATTCTTAT 


301 


TCTTCTGTGG . 


TTGCTGAGTT 


CTGGCTGCGT 


TCAAGGGGGT 


TCACCTCTTC 


CCCTTCTGGC 


361 


GAGTTTTTGC 


TGCGTCTTTC 


CCTAAGAAGC 


AGCGCCACGT 


GCGTGGCGTG 


CCTCAGCCTG 


421 


ACGCGGTGCA 


CCTTTTACGT 


AAGAGCGTCG 


ATAGCATCGG 


TCATCTACAG 


CAGCGTGCTG 


481 


CTGCTTCCGT 


GACCTTTACA 


CTGCTTGTGG 


CGGGCCGTCT 


TGTAGAGGGG 


CCATCTGCTT 


541 


GTTCGCTGCT 


GGACGCAGAC 


CCGGCGCCCG 


ACATTTCCGG 


CAGCCGGGCA 


GTTGAGATAA 


601 


ACCGGCTGCC 


CGGTGGCCGT 


CGAAATTGAA 


GCAGGATCTC 


TACAGTAAGG 


AACAAATCGC 


661 


GCTATTTTTA 


AGGAGTGTGT 


ATAGTTGGGG 


CGTTACTCGT 


GAGTATTGCT 


GATGATGGAC 


721 


GTCCGTGTGG 


GGGGTAAGTA 


TCGTTTGGGG 


AGGAAGATTG 


GGAGCGGATC 


CTTCGGCGAC 


781 


ATCTACCTTG 


GTACGAACAT 


CTCAACAGGA 


GATGAAGTCG 


CTATCAAATT 


GGAAAGCGTG 


841 


CGGTCTAGGC 


ATCCACAACT 


AATCTATGAA 


AGCAAGCTGT 


ACAAAATCCT 


AACGGGTGGA 


901 


ATCGGAATCC 


CGACTCTTTA 


CTGGTATGGG 


ATCGAGGGGG 


ATTACAACGT 


TATGATTATT 


961 


GAGCTTTTGG 


GCCCGTCTCT 


TGAGGACCTC 


TTCAGCATTT 


GCAACAGAAA 


GCTTTCTTTG 


1021 


AAGACTGTTC 


TGATGCTCGC 


CGACCAAATG 


CTAAATCGTA 


TTGAGTTCGT 


CCACAGCAGA 


1081 


CATTTCATCC 


ATCGAG&CAT 


CAAGCCTGAC 


AATTTTTTGA 


TCGGTAGGGG 


CAAAAAGATG 


1141 


TCCATTGTTT 


TTGCTATCGA 


CTTTGGCCTC 


GCAAAGAAGT 


ACAGAGATCC 


CAGAACACAG 


1201 


TCCCATATTC 


CTTATCGAGA 


AGGGAAGAAC 


CTGACAGGTA 


CCGCGAGGTA 


CGCCTCTGTG 


1261 


AACACCCACT 


TGGGAATAGA 


ACAGAGCAGG 


CGCGATGATC 


TGGAAGCGCT 


CGGCTACGTC 


1321 


TTAATGTACT 


TCAACAGAGG 


TTCCTTACCC 


TGGCAAGGAT 


TAAAGGCCAC 


TACGAAGAAA 


1381 


GATAAATATG 


ACAAGATTAT 


GGAGAAGAAG 


ATGTCCACCC 


CTATTGAAGT 


CCTTTGCAAA 


1441 


CAATTTCCAT 


TTGAGTTTAT 


CACATATCTG 


AACTATTGCC 


GGTCTCTGCG 


ATTCGAAGAT 


1501 


CGCCCGGACT 


ATTCCTATTT 


GAGACGGTTG 


TTCAAGGATC 


TTTTCTTCCG 


TGAGGGATAC 


1561 


CAGTATGACT 


TTATATTCGA 


TTGGACATTT 


CTGCATGCTG 


AGAGAGAGCG 


CGAGCGTCAA 


162i 


AGACGATCGA 


TGGTCAACCA 


AGGCGCAGAA 


TCAGGGAACC 


AGTGGAGACG 


AGACGCGTCG 


1 CQ1 

lool 








TTAGAACC GT 


AATCTCTTTA 


CGGGCAGATT 


1741 


GCCGTACGGG 


TCTTCTGCTC 


ATTCAGTGGC 


agtgccaccg" 


CAGTGCTATC 


TGAGGCTGTG 


1801 


GCTTCAGGAT 


GTGGTAGCCA 


GTTACCATGG 


TCACTTGCCC 


TCGCTAGGAC 


AGCCTTCGCA 


1861 


GGGAAATGTC 


ACAGTAGCCT 


GCATTATGTG 


GTGTGAGAAC 


TGCTAGCGCA 


TTCCTGTAGT 


1921 


TGCTTTTACG 


AAGCAGGATA 


CGCAGCGTGC 


ATCACGCGGT 


GGTTCGAGCG 


CTCGCTACGC 


1981 


ATCACAGGGC 


TGTGAGGCAA 


GTTAGTATCT 


TTGGGGGACG 


AGTTGAGAGT 


GTCAGAATCG 


2041 


ATAGTCTCAG 


GGCATGCAGG 


CGAAATGGAG 


GCTGCGCCAG 


TAGTGCCAGC 


CGGTGGCGAA 


2101 


GGCGTCAAAT 


TTACTTTTTT 


TGTTGCTGGG 


GATATTGTTA 


GAGCAACAAC 


TTGGGTCTAG 


2161 


ATGCTACTGA 


TAAAAAAAAA 


AA (SEQ ID 


NO:l) . 
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.1 CCTCGTTTTG CTTCATTCCC CGCCTTTTCT CTGTAGCTAA CCAAAGGAAC AAAGTCAGCG 
61 GTAGAAGCCG TTTCTTCTGT CCGCTTCCCA CTCTTCCCGT TCGGCTGCCC CTGCAGAGCG 
121 CCCTTTCTAT GCGTTGCCAC CCGTCTGCAA GTATCGCGTC TTTCGTCTCA TCAGTGATTT 
181 TCTTTGCGTG TCGCGTTCGG GACGCCCTTT TCTCTCCTCA ACTAACTAGC AGACGTTTCT 
241 TCCGTCCCGC ATGCGACAGC GAAGGGCACG TCCCCCCAGT TCTTCATCGC CCACCTGTTG 
301 TGCAACTTGT CGCCCGTCGT TCTTCACTTC TTCTCTCCCA TCCTCTCGTG ACTCTTCCTC 
361 TCGAGAACTC TTTCTGTCGA ACTCTCAACC CCCACGACTG CTGGTTTCGT GGCCGTCCCG 
421 CATGCACCTT GTGTCCCGCC GCCTTGGCGC AAACACCCGC TTTCTCTGCT GTCCGCCTCC 
481 CGGTGGACTT CTCTCCGTGT TTTTTCGTGT TGCCAAAAGT TTGTCTGCTT TGACGTTTCT 
541 CTGCTCACCC ATTGCCCGCT CTTGATGAGG AACGCTCCAC ATTGACAGCG AACTCACAGC 
601 ACGCACCCTC CGCGAGCGGA CTTTCACGAG CGAGGCAAGA ATCCATCGTC ACCCCGCCTA 
661 CACGTACACT ACTCCACTTG GGTGCCCACG CGCGGCTTCT GGGAGACAGA GACGGTCCTC 
721 GTTTTCCGTG TCAGAACTTT GTCGAGGAAA CGCTGCTGCT GGCAGCGGGG ATTGTGACCC 
781 CCCTCGGCGA ACGGGCGAAG CCGCCCTGTC GCGCGTCGCG ACTCAGCTGA GGCGACAGGC 
841 GGTCGGCGGC GTGACCTCTC TTTCTTTTTG CATTCGGCCC TGATTGCAGC ACGAAGGATG 
901 GAGGTCAGGG TCGGAGGCAA GTACCGACTT GGTCGGAAGA TCGGCAGCGG GTCATTCGGT 
961 GATATTTATA TCGGTGCAAA CATTTTGACG GGGGATGAGG TGGCGATCAA GTTGGAGTCT 
1021 ATCAAGTCGA AGCACCCGCA GCTGCTCTAT GAGTCGAAGC TGTACAAACT GCTGGCTGGC 
1081 GGCATTGGGA TTCCCATGGT CCACTGGTAC GGCATCGAAG GAGACTACAA TGTTATGGTT 
1141 ATCGACCTTC TCGGCCCTTC TCTGGAGGAC CTTTTCAGTA TCTGCAATCG CAAACTCTCT 
1201 CTCAAGACGG TGTTGATGCT CGCAGACCAG ATGCTCAACC GCATCGAGTT TGTCCATAGC 
1261 AAGAACTTCA TCCATCGCGA TATCAAACCC GACAACTTCC TCATTGGCCG TGGAAAGAAG 
1321 ATGTCCGTCG TCTACATCAT CGATTTCGGT TTGGCAAAGA AATATCGAGA CCCAAAGACT 
1381 CAGCAACATA TCCCATACAG GGAAGGCAAG AACCTAACAG GCACAGCGCG TTACGCTTCC 
1441 ATCAACACCC ACCTGGGGAT CGAGCAGAGT CGGCGAGACG ACCTAGAGGC GCTCGGTTAC 
1501 GTTCTCATGT ACTTCAATAG AGGTTCTCTT CCGTKGCAGG GTCTGAAGGC GACGACGAAG 
1561 AAGGACAAAT AC G ACAAGAT TATGGAGAAG AAAATGTCTA CTCCCATCGA AATTTTGTGC 
1621 AAGCATTTCC CATTCGAGTT CATCACCTAC TTGAATTACT GCCGGTCCCT GCGCTTCGAG 
1681 GATCGTCCTG ACTACGCATA CTTGCGACGC CTGTTCAAAG ACTTGTTTTT TAGAGAGGGA 
1741 TATCAGTACG ACTTCATCTT CGACTGGACT TTCATCAACA CGGAGAAGGA TCGCGCGAGT 
1801 CGAAGAAGCC AGCAAGTTTA TGTGGAAGAC AACCGGCAAG TTGAGGAGAA TCAGAACGAG 
1861 TTGCCGATGT AGGGTGGTCG GTGTGCGGAG GCCGGCGGGG AGCGTGGAGT CCGCTGAGTC 
1921 TGGAAGTCTG CAGACTGTGC TCTGGCACTC GACCCACTTG TTTGTGTTTC CCTCGACTCG 
1981 CGCAGGTCGA GGAAAACAGA GACGAACAGG TTACCCAGGA GTGTTTTTGG TCAGGACGCG 
2041 CGTCTCCCTC TGAGTTTCGC AAAGTTGCCC CTGGAA (SEQ ID NO: 3) . 
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1 TTAACCCTCA CTAAAGGGAA CAAAAGCTGG AGCTCCACCG CGGTGGCGGC GCACCGAGGA 
61 AAACGCAGCT CGTAAGAGAC AGTTCTCTCG GTGAGAAGAG CTATCCGAGA AGGACACCAT 
121 GGCGCACCAT CAAGACACCC GCAACCACAC GGGGGTCGGA CCCTCTTCGT CTATCCCTCT 
181 GAAAGATTTG AAGATCGCCG GCGTCTGGAA AATCGGCAGA AAAATCGGAT CCGGTTCCTT 
241 CGGCGACATA TACAAAGGCC TGAATTCTCA GACCGGTCAG GAGGTGGCGC TGAAGGTCGA 
301 AAGCACCAAG GCGAAGCATC CGCAGTTGCT GTACGAATAC AAACTTTTGA AGCATTTGCA 
361 GGGAGGAACG GGCATTGCTC AAGTGTTCTG TTGCGAGACT GCGGGCGACC ATAACATCAT 
421 GGCCATGGAG TTGCTCGGAC CTTCTTTAGA GGACGTCTTC AACTTGTGCA ATCGCACCTT 
481 CTCTCTCAAA ACCATTCTTC TTCTCGCCGA CCAGTTTCTG CAACGCGTCG AGTACATCCA 
541 CTCCAAGAAT TTCATTCACA GAGATATCAA ACCAGATAAC TTTCTTCTCG GCGGTGCCGG 
601 CAATCAAAAC AC GATCTACG TGATCGACTT CGGCCTGGCG AAGAAGTTTC GCGATCCGAA 
661 AACGCACCAA CATATTCCGT ACAGAGAAAA CAAGAATCTC ACGGGAACGG CGCGCTACGC 
721 GTCCATCAGT GCGCATCTGG GTTCCGAGCA GAGTCGCCGA GATGACCTCG AAGCAGTCGG 
781 CTACGTTCTC ATGTACTTCT GTCGAGGAGG CACGCTGCCT TGGCAGGGCA TCAAAGCGAA 
841 TACCAAACAG G AG AAGT AC C ACAAGATCAT GGAGAAGAAG ATGTCGACGC CCGTCGAGGT 
901 GCTATGCAAG GGATATCCAA GCGAATTTGC CACATACTTG CACTACTGCC GCTCCTTGCG 
961 ATTCGAGGAC CGACCGGACT ACGCCTACCT CAAGCGACTC TTTCGAGATC TCTACATCAA 
1021 AGAGGGCTAC GATGACAGTG ACCGCGAATT CGACTGGACA GTGAAACTTT CGTCGCGCAG 
1081 TCTCGGACCG CCAAGCAGTC GAGCGCAACA TGTTTTACTG AGTCAAGACA CCCGAACGCG 
1141 AGGGAAGCGG GAGACAGATC GACCTGTCGC TGCGCGGAGT GGCGACCGCG AACGAGGAAT 
1201 CCATTTCAGC AACGGGAACG TGGGCAATCC TTCGATGGCA ACGAACCCCG GCGGCCTGTC 
1261 AGTCATGGTG CATGAACGCA CGAGTCTGGT GGATCAGGGA GACCGTGGGT CGCGCGAAAC 
1321 TTCTACGCGG AAAGAAGACG CGAAGGACGG CAGATGGCCA GGAGGCAGAT TTTCTTGTCT 
1381 TCCACTGTTA TGTCGGCGCT CTCCGACGAA GGCCTAGATG AACTGCGGAG GCGCTCCTGT 
1441 CCCCGCAGTT GGCATCTCTC TCCTTCATTG TCGTTGTTCC CCTGCAACTC GAGTCCACCC 
1501 TTGACATCCT CGTCTCTCTC TTCCTGTCGG TTTCCTCTTT CTCGTCCTCT CCCCCCTAGC 
1561 TTCGTTCTCT C CTTTCTATC CTGCTTCGGC GTCGCCTCAC TTCTCTCCTC ACTTCTCTCC 
1621 CTTTTGTTTT TCTTCGCGGC GTCTCTCCTT CACTCTGTCT CCGCCTCTGA CGCCGCGCGG 
1681 GAGCCGTTTC CTGCAGGCAG CTCAGGCAAT ACCTGCCTGC AGGTGCCTCT CCTTTTTGAG 
1741 CGTCTCTCTT TCCTCGTCGA AACGGTCCTC ACAGCTTCCT CTCCCTGGGG ACGCCGTGGG 
1801 CGTAAGTTCT TTTTTTGACG GTCCCGGTGG GCTGGCGTTG TTCGCCTGCC TTCCGCGCAT 
1861 GCACTCCGAG CATTTTTGCC TGGCCTGGAC TTCTCCGAGC GAGAGTTGCG GTTTGGCTTC 
1921 TGCATCGTCT CCTGCGCTGC TTTCATTTCT CTAGGTTTCT GCTTGCGGCC TCCGTGTACA 
1981 GAAATCGGAA GGTGAAGGCG TAGTGGCCAG AGAACGAAGC AAACGAGAGA ACCACGTTCC 
2041 ACCTTGTGCG CACGCATGCA TCTACGCATG CACGGTATTT AAGCCGATTT TTTGTGTATG 
2101 TATATAGATG TATATATATA TGTATCTACA TGTATCTACC TATATATATG TGTGTGTGTA 
2161 AGTGGAAGTG TATTTTTGCA TGTGCAGAAA GCTTTCTTTT CCGCTGGCAT GCTGGAAGAA 
2221 GGGCAGGAGG CGACGATCCT GCGAGTCAGG GCGTTCCCTT GTTTCCAGTG AGTTAACCGA 
2281 ATTGTTTATT GATATGCGTT TGCATGCATC GACAATGGAT CCTAGACACG CCCGTTTAAA 
2341 ATCAGAGGTA TTCCTAAAAA AAAAAAAAAA AAA (SEQ ID NO: 5) . 
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TgCKIp (SEQ ID NO: 6) 
TgCKia (SEQ ID NO: 4) 
EtCKia (SEQ ID NO: 2) 
PfCKla (SEQ ID NO: 21) 
LmCKI-2 (SEQ ID NO: 22) 
TCCKI-2 (SEQ ID NO: 23) 

TgCKIp (SEQ ID NO: 6) 



1 CKlfl-Nt • * 

(1) |MAHT?QDTRNHTGVGPSSSIPLKfc Lj^AGVWKIGRKIGSGSro 

(1 ) meV RVGGKYRLGRKIGSGSFX3DIYIGANILTGDEVAIKLBSIKSKHPQLLYESK 

(1 ) MDV RVGGKYRLGRKIGSGSFGDIYLGTN1STGDEVAIKLESVRSRHPQLIYESK 

(1) MEI RVANKYALGKK IXSSGSFGDIWA KDIVT>BEFAVKLESTRSKHPQLI < YE SK 

(1) _ MNVEI RVGNRYRIGQKtrGSGSFGHIF^GTNIQTGDPVAIKLEQVKTRHPQLTYESR 

(1) MSLEI RVGNRFRLGQKIGAGSFGEIFRGTNIQTGETVAIKLEQAKTRHPQLALEAR 

_** * ******** * ***** ** *** **** ** 

76 150 

( 76 ) LLKHLQGG TGIAQVFCCTTAGDHNIMAMELLCPSLEDVFNIX^^ 

( 55 ) LYKLLAGG IGIPMVTIWYGIEGDYNVMVIDLLGPSLEDLFSICNRKLSLK^ 

( 55 ) LYKILTGG IGIPTLYWYGIEGDYNVMIIELIX3PSLEDLFSI 

(55) LYKI LGGG IGVPKVYWYGIEGDFTIMVLD1>LGPSLEDLFTLCNRKFSI^ 

( 57 ) FYRILGSGGGAVGIPMMFYHGVEGEFNVMVIELIXSPSL^ 

{ 57 ) FYRILNAGGGWGIPNILFYGVEGEFNVMVMDLLGPSI^ 

it * * *** * ** * * ********** * * **** ******* *** **** ** 

151 CKla-It 225 

( 148 ) PJJIKPPNFLU&AGNQNTIYVIDFGLAKKrRDPKT HQ^^ 
(127) ITOIKPDNFLIGRGKKMSWYIIDreiAKKYRDPKT q^HIPYRE^ 
(127) RDIKPDNFLIGRGKIOISIVYAIDFGIAKKYRDPRTQS^ 
( 127 ) RDIKPDNFLIGRGKKVT lLIHIIDFGLAKk YRDSRSHTHIPYKEGKNLT^ 

(132) RDIKPDNFLMGTG KKGHHVYIIDFGLAKK YRDPRTHAHI PYKEGKSLTGTARYCSINTHMGVEQGRRDDMEGIGY 

( 13 2 ) RDMKPDNFLMGTG K^ColmvyVVI^GLAKi KYRDPR 

********* * *** ************ * **** *** ******* ******* ******* * ** 

226 300 
(223 ) VIiMYFCRGGTLPWQGIKANTKQEKYHKIMEKKMST PVEVLCKGY PS EFATYLHYCR SLRFEDRPDYAYLKRLFRD 
( 2 02 ) VLMYFNRG- S LPWOGLKATTKKDKYDKIMEKKMSTPIEIIXTKHFPFEFITYI^ 

(202) vi^fnrg-slpwqglkattkkdkydkimekkmstpie^ 

(202 ) vl-wyflrg-slpwqglkaiskkdkydkimekkists 

( 2 07 ) ii^plrg-slpwqglkahtkqekynriserkqttpvellckgfpsefaaymnyvra^ 

(207) ilmyflrg - s lpwcg1jcahtkqekysriserkqttpvetlckgf paefaay lnyirs lrfedkpdysylkrlfre 

**** ** ********* ** ** * * * ** * *** * ** **** ******* *** ** *** * 

301 _ CKta-Ct 375 

(298) LYIKEGYDDSDREFDWT /KLS SRSLGP PS SRR.QHVLLSQDTRTRGKJIETDRPVAARSGDRERGIHFSNGNVGNPS 

(276) LFFREGYQY-DFIFDWI FINTBKDRAS RfcSQQyYV^DNRQVEENQNELPM! 

(276) LFFREGYQY- DFI FDWI FLHAER£RERQRRSMVNQGAESGNQWRRDASGRDPLGRLPQLEP 

(276) LFIREGFTY-DFLFDWI CVYASEKDKKKMLENKNRFDQTADQEGKDQRNN 

(281) LFVREGYHV- DYVFDWT jKRIHESLQEQQSFPGGSNGGGAAGNGS PVNQS PAQGGNGGAPNSANNQE SGAPEQQ 

(281) LFIREGYHV-DYVFDWr jKRIHIWLKAEGSGQ^EQKQQ^QQRERGDVEQA 
** **** * * * * * 

376 CKlfl-Ct 435 

(373 ) MA.TNPGGL SV>IVHERTSLVDQGPRG S;RETSTRKEDAKDGRWPGGRF 3) C LPLICRRS PTKA 



Brackets mark the catalytic core region 
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MDV RVGGKYR LGRK IGSGSF GDIYLGTNIS TGDEVAIKLE 
SVRSRHP QL- C : YgS5KLYKILT GGIG IPTLYW YGIEG DYNVM 
IIELLGPSLE DLFS I CNRKL SLK^VLMIAP ,: QMtiNBjlEFVH 
SRHFIHRDIK PDNPiiQRGK KMS I VFAIDF GLAKKYRD PR 
TQ^^^EGl^T^ 

YVixMYFNRGS L PWQGLKATT KKDKYDK I ME KKMSTPIEVL 
CKQFPFEFIT YLNYC RS LRjFv. EiDRP^If gyijRi RLFKOIifl^E 
GYQYDFIFDW TFLHAERERE RQRRSMVNQG AESGNQWRRD 
ASGRDPLGRL PQLEP (SEQ ID NO: 2) 
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